Korpus: est_news_2008

Weitere Korpora

3.7.3 Distribution of the string similarity for different rank ranges

Distribution of the Levenshtein distance for words of rank

String similarity for top-1.000 words
Distance Percentage of words
0 32.2981
1 41.6149
2 26.0870
String similarity for top-10.000 words
Distance Percentage of words
0 6.5260
1 34.2013
2 59.2727
String similarity for top-100.000 words
Distance Percentage of words
0 3.3368
1 25.0645
2 71.5987
String similarity for top-1.000.000 words
Distance Percentage of words
0 3.2159
1 24.9039
2 71.8801
422 msec needed at 2018-02-27 03:22